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September 1982 ACM Computing Surveys (CSUR), volume 14 issue 3 
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5 Adapting content to mobile devices: Fractal summarization for mobile devices to 
access large documents on the web 
Christopher C. Yang, Fu Lee Wang 

May 2003 Proceedings of the twelfth international conference on World Wide Web 

Full text available: ■j^ pdf(317.55 KB) Additional Information: full citation , abstract , references , index terms 

Wireless access with mobile (or handheld) devices is a promising addition to the WWW and 
traditional electronic business. Mobile devices provide convenience and portable access to 
the huge information space on the Internet without requiring users to be stationary with 
network connection. However, the limited screen size, narrow network bandwidth, small 
memory capacity and low computing power are the shortcomings of handheld devices. 
Loading and visualizing large documents on handheld devices bee ... 

Keywords: document summarization, fractal summarization, handheld devices, mobile 
commerce 
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October 1985 ACM Transactions on Information Systems (TOIS), volume 3 issue 4 
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The formalization of the architecture of documents and text formatting are the central 
issues of this paper. Besides a fundamental and theoretical approach toward these topics, 
an overview is presented of the CO BATE F system. The COBATEF system is a context-based 
text formatting system, for which a software, as well as a hardware, implementation is 
available. A unique feature of the system is its automatic text-element recognition 
mechanism, which is context based and consequently ... 
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Automatic abstracting and indexing— survey and recommendations | 
H. P. Edmundson, R. E. Wyllys 

May 1961 Communications of the ACM, volume 4 issue 5 

r- .. * ^ •. ui a ma n>i udn Additional Information: full citation , abstract , references , citings, index 
Full text available: TO pdf(1.04 MB) 

t£ -^ terms 

In preparation for the widespread use of automatic scanners which will read documents and 
transmit their contents to other machines for analysis, this report presents a new concept in 
automatic analysis: the relative-frequency approach to measuring the significance of words, 
word groups, and sentences. The relative-frequency approach is discussed in detail, as is its 
application to problems of automatic indexing and automatic abstracting. Included in the 
report is a summary of automatic ana ... 

8 The automatic generation of literature abstracts: an approach based on the | 
identification of self-indicating phrases 
C. D. Paice 

June 1980 Proceedings of the 3rd annual ACM conference on Research and 
development in information retrieval 

Full text available: |£| pdf(1.04 MB) Additional Information: full citation , references , citings 



Passage-level evidence in document retrieval 
James P. Callan 



http : //portal .acm.org/results. cfm?coll=ACM&dl=ACM&CFlD=334. . . 12/8/04 



Results (page 1): +automatic <paragraph> +documentatio. . . Page 3 of 5 



August 1994 Proceedings of the 17th annual international ACM SIGIR conference on 
Research and development in information retrieval 
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10 Automatic phrase indexing for document retrieval 
J. Fagan 

November 1987 Proceedings of the 10th annual international ACM SIGIR conference on 

Research and development in information retrieval 

r- ... ^ •, ui 0 w«/<f4AMm Additional Information: full citation , abstract, references , citings, index 
Full text available: TO pdf(1.10 MB) 

terms 

An automatic phrase indexing method based on the term discrimination model is described, 
and the results of retrieval experiments on five document collections are presented. 
Problems related to this non-syntactic phrase construction method are discussed, and some 
possible solutions are proposed that make use of information about the syntactic structure 
of document and query texts. 

11 Special issue on natural language generation: Generating natural language summaries 

from multiple on-line sources 

Dragomir R. Radev, Kathleen R. McKeown 

September 1998 Computational Linguistics, Volume 24 issue 3 

Full text available. ^j p^^sMB)^ Additional Information: full citation , abstract , references , citings 
Publisher Site 

We present a methodology for summarization of news about current events in the form of 
briefings that include appropriate background (historical) information. The system that we 
developed, SUMMONS, uses the output of systems developed for the DARPA Message 
Understanding Conferences to generate summaries of multiple documents on the same or 
related events, presenting similarities and differences, contradictions, and generalizations 
among sources of information. We describe the various components ... 

12 Multi-paragraph segmentation of expository text 
Marti A. Hearst 

June 1994 Proceedings of the 32nd conference on Association for Computational 
Linguistics 

Pdf(772.92 KB) 

^Additional Information: full citation , abstract , references , citings 
Publisher Site 

This paper describes TextTiling, an algorithm for partitioning expository texts into coherent 
multi-paragraph discourse units which reflect the subtopic structure of the texts. The 
algorithm uses domain-independent lexical frequency and distribution information to 
recognize the interactions of multiple simultaneous themes. Two fully-implemented versions 
of the algorithm are described and shown to produce segmentation that corresponds well to 
human judgments of the major subtopic boundaries of th ... 

13 Subtopic structuring for full-length document access | 
Marti A. Hearst, Christian Plaunt 

July 1993 Proceedings of the 16th annual international ACM SIGIR conference on 

Research and development in information retrieval 

i- ii* ^ i ui 0i ^noiim Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(1.02 MB) 

terms 

We argue that the advent of large volumes of full-length text, as opposed to short texts like 
abstracts and newswire, should be accompanied by corresponding new approaches to 
information access. Toward this end, we discuss the merits of imposing structure on full- 
length text documents; that is, a partition of the text into coherent multi-paragraph units 
that represent the pattern of subtopics that comprise the text. Using this structure, we can 
make a distinction between th ... 
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14 Selective text utilization and text traversal 
Gerard Salton, James Allen 

December 1993 Proceedings of the fifth ACM conference on Hypertext 

Full text available: g pdfH.20 MB) Additional Information: full citation , references , citings, index terms 



Keywords: automatic text linking, full-text access, global text comparisons, information 
retrieval, local context checking, passage retrieval, selective text reading, text analysis, text 
summarization 



15 Sequential thematic organization of publications: how to achieve coherence in 

proposals and reports 

J. R. Tracey, D. E. Rugh, W. S. Starkey 

August 1999 ACM SIGDOC Asterisk Journal of Computer Documentation, volume 23 issue 3 
Full text available: ^ pdf(3.80 MB) Additional Information: full citation , index terms 



16 Multimedia document presentation, information extraction, and document formation in 
MINOS: a model and a system 

S. Christodoulakis, M. Theodoridou, F. Ho, M. Papa, A. Pathria 

December 1986 ACM Transactions on Information Systems (TOIS), Volume 4 issue 4 

_ i, , , u, « ^/oicoox Additional Information: full citation , abstract, references , citings, index 
Fu text available: TOpdf(3.16 MB) 

ka-*— * terms , review 

MINOS is an object-oriented multimedia information system that provides integrated 
facilities for creating and managing complex multimedia objects. In this paper the model for 
multimedia documents supported by MINOS and its implementation is described. Described 
in particular are functions provided in MINOS that exploit the capabilities of a modern 
workstation equipped with image and voice input-output devices to accomplish an active 
multimedia document presentation and browsing within docu ... 

17 Technical papers: Towards topic-based summarization for interactive document 
viewing 

Achim Hoffmann, Son Bao Pham 

October 2003 Proceedings of the international conference on Knowledge capture 

Full text available: ^pdfd 20.31 KB) Additional Information: full citation , abstract , references , index terms 

Our research aims at interactive document viewers that can select and highlight relevant 
text passages on demand. Another related objective is the generation of topic-specific 
summaries of texts as opposed to general purpose summaries. This paper introduces our 
notions of discourse structure tree and level-of-detail tree. Both structures are used to 
represent relevant aspects of a text segment for the above mentioned purposes. 
' Furthermore, we introduce a Knowledge Acquisition Frame ... 

Keywords: knowledge acquisition, natural language processing 



18 A flexible object merging framework 
Jonathan P. Munson, Prasun Dewan 

October 1994 Proceedings of the 1994 ACM conference on Computer supported 
cooperative work 

r- ... . IM 0 , rtMm Additional Information: full citation , abstract, references , citings, index 
Full text available: ^ pdf(1.40MB) 

The need to merge different versions of an object to a common state arises in collaborative 
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computing due to several reasons including optimistic concurrency control, asynchronous 
coupling, and absence of access control. We have developed a flexible object merging 
framework that allows definition of the merge policy based on the particular application and 
the context of the collaborative activity. It performs automatic, semi-automatic, and 
interactive merges, supports semantics-determined m ... 

Keywords: diff, flexible coupling, merging, optimistic concurrency control, undo, versions 

19 Text Categorization: Topic difference factor extraction between two document sets and 
its application to text categorization 

Takahiko Kawatani 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: |§ pdf(249.49 KB) Additional Information: full citation , abstract , references , index terms 

To improve performance in text categorization, it is important to extract distinctive features 
for each class. This paper proposes topic difference factor analysis (TDFA) as a method to 
extract projection axes that reflect topic differences between two document sets. Suppose 
all sentence vectors that compose each document are projected onto projection axes. TDFA 
obtains the axes that maximize the ratio between the document sets as to the sum of 
squared projections by solving a generalized eigenv ... 

20 Fast detection of communication patterns in distributed executions B 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: ^£ |pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 
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21 aTool: creating validated XML documents on the flv using MS word 
Oliver Meyer 

October 2002 Proceedings of the 20th annual international conference on Computer 
documentation 

Full text available: ^ pdf(239.02 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes aTool, an extension to Microsoft's Word to create XML documents. 
aTool has been developed in a joint project of the publisher Springer Verlag, Technical 
University of Munich (TUM), and Technical University of Aachen (RWTH). It has been 
developed to provide Springer Verlag with uniform XML documents from its authors and has 
become a generic XML creation tool that can be adapted to different document 
structures. For an author, aTool derives XML structures from MS Word editing c ... 

Keywords: A-Posteriori Integration, DOM, Microsoft Office, Microsoft Word, Microsoft Word 
Add-In, XML, character formatting 



22 Documentation production from a formal database 
Christopher Hartsough, Yuzo Yamamoto, E. David Callender 

January 1982 Proceedings of the 1st annual international conference on Systems 
documentation 

r- ... •. ui 0 ™, Q1/m Additional Information: full citation , abstract, references , citings, index 

Full text available: TO pdf(899.18 KB) - 

^ terms 

This paper reports on an existing, operational prototype system, TG/TF2, for the generation 
of typeset quality documentation from a formal database. TG/TF2 directly supports the 
conceptual separation of system design, document content design, and document format 
design. Specifically, support for system design is supplied by Problem Statement 
Language/Problem Statement Analyzer (PSL/PSA), a development of the ISDOS Project at 
the University of Michigan. Document content design support is pro ... 

23 Interactive Editing Systems: Part II 
Norman Meyrowitz, Andries van Dam 

September 1982 ACM Computing Surveys (CSUR), Volume 14 issue 3 

Full text available: ^ pdf(9.17 MB) Additional Information: full citation , references , citings , index terms 



24 The evaluation of text editors: methodology and empirical results. 
Teresa L. Roberts, Thomas P. Moran 
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April 1983 Communications of the ACM, volume 26 issue 4 

r- ^ . lu 0 M n*%AiAB\ Additional Information: full citation , abstract, references , citings, index 
Full text available: 153 pdf(2.24 MB) : 
— terms , review 

This paper presents a methodology for evaluating text editors on several dimensions: the 
time it takes experts to perform basic editing tasks, the time experts spend making and 
correcting errors, the rate at which novices learn to perform basic editing tasks, and the 
functionality of editors over more complex tasks. Time, errors, and learning are measured 
experimentally; functionality is measured analytically; time is also calculated analytically. 
The methodology has thus far been u ... 

Keywords: ergonomics, human factors, human-computer interaction, human-computer 
interface, system design, system evaluation, text editing, user model, user performance, 
user psychology 



25 Computer Evaluation of Indexing and Text Processing 
G. Salton, M. E. Lesk 

January 1968 Journal of the ACM (JACM), volume 15 issue l 

Additional Information: full citation , abstract , references , citings , index 



Full text available: ' r 

^ terms 

Automatic indexing methods are evaluated and design criteria for modern information 
systems are derived. 

26 Beyond boolean search: FLEXICON. a legal tex-based intelligent system 
Dephne Gelbart, J. C. Smith 

May 1991 Proceedings of the third international conference on Artificial intelligence 
and law 

Full text available: ^ pdf(955.73 KB) Additional Information: full citation , references , citings , index terms 



27 Interactive Editing Systems: Part I 
Norman Meyrowitz, Andries van Dam 

September 1982 ACM Computing Surveys (CSUR), Volume 14 issue 3 

Full text available: "gpdf(3.08 MB) Additional Information: full citation , citings, index terms 



28 Improving the effectiveness of information retrieval with local context analysis 
Jinxi Xu, W. Bruce Croft 

January 2000 ACM Transactions on Information Systems (TOIS), volume 18 issue l 

r- n * ^ i ui 0 ^ /4 Aonobrm Additional Information: full citation , abstract, references , citings, index 

Full text available: TO pdfd 93.02 KB) : 

terms , review 

Techniques for automatic query expansion have been extensively studied in information 
research as a means of addressing the word mismatch between queries and documents. 
These techniques can be categorized as either global or local. While global techniques rely 
on analysis of a whole collection to discover word relationships, local techniques emphasize 
analysis of the top-ranked documents retrieved for a query. While local techniques have 
shown to be more effective that global techniques in ... 

Keywords: cooccurrence, document analysis, feedback, global techniques, information 
retrieval, local context analysis, local techniques 
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M. Cecelia Buchanan, Polle T. Zellweger 
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Keywords: automatic scheduling, automatic temporal layout, media synchronization, 
multimedia document formatting, temporal constraints 



30 Automatic categorization of case law 
Paul Thompson 

May 2001 Proceedings of the 8th international conference on Artificial intelligence 
and law 

r- .. . ^ , Ll fi» +ic/QA oh u'D\ Additional Information: full citation , abstract , references , citings , index 

Full text available: 1 ™ pdf(84.81 KB) 

^-^ terms 

This paper describes a series of automatic text categorization experiments with case law 
documents. Cases are categorized into 40 broad, high-level categories. These results are 
compared to an existing operational process using Boolean queries manually constructed by 
domain experts. In this categorization process recall is considered more important than 
precision. This paper investigates three algorithms that potentially could automate this 
categorization process: 1) a nearest neighbor-like a ... 

Keywords: text categorization 



31 A comparison of classifiers and document representations for the routing problem 
Hinrich Schutze, David A. Hull, Jan O. Pedersen 

July 1995 Proceedings of the 18th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: ^pdf(1.16 MB) Additional Information: full citation , references , citings , index terms 



32 Incremental document formatting 

Pehong Chen, Michael A. Harrison, Ikuo Minakata 

January 2000 Proceedings of the ACM conference on Document processing systems 

Full text available: ^| pdf(631.10 KB) Additional Information: full citation , references , index terms 



33 A structural view of the Cedar programming environment 

Daniel C. Swinehart, PolleT. Zellweger, Richard J. Beach, Robert B. Hagmann 

August 1986 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 8 Issue 4 

r- iii ^ , U1 fii , f/eo ... m Additional Information: full citation , abstract , references , citings , index 
Full text available: |j| pdf(6.32 MB) terms 

This paper presents an overview of the Cedar programming environment, focusing on its 
overall structure— that is, the major components of Cedar and the way they are organized. 
Cedar supports the development of programs written in a single programming language, 
also called Cedar. Its primary purpose is to increase the productivity of programmers whose 
activities include experimental programming and the development of prototype software 
systems for a high-performance personal computer. T ... 

34 Spoken dialogue technology: enabling the conversational user interface 
March 2002 ACM Computing Surveys (CSUR), Volume 34 issue l 

Full text available: ^ pdf(987.69 KB) Additional Information: full citation , abstract , references , citings , index 
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terms , review 

Spoken dialogue systems allow users to interact with computer-based applications such as 
databases and expert systems by using natural spoken language. The origins of spoken 
dialogue systems can be traced back to Artificial Intelligence research in the 1950s 
concerned with developing conversational interfaces. However, it is only within the last 
decade or so, with major advances in speech technology, that large-scale working systems 
have been developed and, in some cases, introduced into commerc ... 

Keywords: Dialogue management, human computer interaction, language generation, 
language understanding, speech recognition, speech synthesis 



35 Efficient web browsing on handheld devices using page and form summarization 
January 2002 ACM Transactions on Information Systems (TOIS), Volume 20 issue l 

r- ... ^ u, 0 ^,aa-,*a^ Additional Information: full citation , abstract, references , citings, index 
Fu text avai able: TO pdf(4.47 MB} 

— terms , review 

We present a design and implementation for displaying and manipulating HTML pages on 
small handheld devices such as personal digital assistants (PDAs), or cellular phones. We 
introduce methods for summarizing parts of Web pages and HTML forms. Each Web page is 
broken into text units that can each be hidden, partially displayed, made fully visible, or 
summarized. A variety of methods are introduced that summarize the text units. In 
addition, HTML forms are also summarized by displaying just the t ... 

Keywords: PDA, Personal digital assistant, WAP, WML, forms, handheld computers, mobile 
computing, summarization, ubiquitous computing, wireless computing 



36 Pen computing: a technology overview and a vision I 
Andre Meyer 

July 1995 ACM SIGCHI Bulletin, Volume 27 Issue 3 

Full text available: ^ pdf(5,14 MB) Additional information: full citation , abstract , citings , index terms 

This work gives an overview of a new technology that is attracting growing interest in public 
as well as in the computer industry itself. The visible difference from other technologies is in 
the use of a pen or pencil as the primary means of interaction between a user and a 
machine, picking up the familiar pen and paper interface metaphor. From this follows a set 
of consequences that will be analyzed and put into context with other emerging 
technologies and visions.Starting with a short historic ... 

37 Summarization: The use of unlabeled data to improve supervised learning for text | 
summarization 

Massih-Reza Amini, Patrick Gallinari 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: g pdf(276.81 KB) Additional Information: full citation , abstract , references , index terms 

With the huge amount of information available electronically, there is an increasing demand 
for automatic text summarization systems. The use of machine learning techniques for this 
task allows one to adapt summaries to the user needs and to the corpus characteristics. 
These desirable properties have motivated an increasing amount of work in this field over 
the last few years. Most approaches attempt to generate summaries by extracting sentence 
segments and adopt the supervised learning paradigm ... 

Keywords: machine learning, semi-supervised learning, text summarization, text-span 
extraction 
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Volume 12 Issue 4 

_ ii , , , u. « Jf/0 cc .™ Additional Information: full citation , abstract, references , citings, index 
Full text available: ^ pdf(3.55 MB) tems 

In traditional interactive programming environments, each application individually manages 
its interaction with the human user. The result is duplication of effort in implementing user 
interface code and nonuniform— hence confusing— input conventions. This paper presents 
an approach to support automatic generation of user interfaces in environments based on 
algebraic languages. The approach supports the editing model of interaction, which allows a 
user to view all appli ... 
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Queries to text collections are resolved by ranking the documents in the collection and 
returning the highest-scoring documents to the user. An alternative retrieval method is to 
rank passages, that is, short fragments of documents, a strategy that can improve 
effectiveness and identify relevant material in documents that are too large for users to 
consider as a whole. However, ranking of passages can considerably increase retrieval 
costs. In this article we explore alternative query evalua ... 
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